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Choosing  the  Best  of  the  Current  Crop 
By 

Stephen  M.  Samuels  and  Gregory  Campbell 
1 .  INTRODUCTION  AND  SUMMARY 

This  paper  presents  a  new  version  of  the  so-called  "best  choice" 
problem  -  that  is,  the  problem  of  sequential  selection  of  the  best  one 
of  a  sequence  of  n  rankable  items  which  appear  in  random  order. 

Two  standard  versions  of  the  problem  are  the  "no-information" 
problem,  in  which  only  the  successive  relative  ranks  of  the  items  can 
be  observed,  and  the  "full  information"  problem  in  which  observations 
are  successive  values  in  a  random  sample  from  a  known  continuous  dis¬ 
tribution.  Optimal  stopping  rules  and  their  probabilities  of  success 
(i.e.  choosing  the  best)  are  contained  in  Gilbert  and  Mosteller  (1966). 

As  n  -*•  «>  the  optimal  probabilities  are  asymptotically  e"1  «  .37  and 
v*  fa  .58  in  the  two  problems. 

These  two  versions  are  not  only  inherently  oversimplified  models 
for  selection,  but  may  be  criticized  as  being  in  one  case  too  constrain¬ 
ing  (based  only  on  relative  ranks)  and  in  the  other  too  demanding  (the 
distribution  must  be  known  precisely). 

A  number  of  intermediate  versions  have  been  proposed,  which 
feature  some  form  of  partial  prior  knowledge  of  the  distribution. 
Petruccelli  (1978)  showed  that  if  the  distribution  is  known  to  be  normal, 
then  the  maximin  success  probability  is  asymptotically  v*;  while  if  it 
is  uniform  on  an  interval  of  known  length  the  limiting  probability  is 
strictly  between  e"1  and  v*.  However  for  the  class  of  all  uniform 
distributions,  Samuels  (1979)  showed  that  the  maximin  stopping  rule  is 
based  only  on  relative  ranks;  hence  the  limiting  success  probability 
Is  e"\  The  same  limit  applies  to  a  closely  related  Bayesian  problem 
of  Stewart  (1978).  Giving  the  end-points  (a,b)  of  the  uniform  distribution 
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a  conjugate  prior  density  of  the  form 

f(a,b)  =  m(m-l  )($-a)m"  V(b-a)m+^  -°°<a<a<e<b«» 

he  found  that  when  the  risk  is  1  -  .  P(item  selected  is  largest  in  the 
sample  and  >  3)  the  Bayes  stopping  rule  is  based  only  on  relative  ranks 
with  respect  to  3  as  well  as  to  preceding  observations. 

Campbell  (1977)  placed  a  Dirichlet  process  prior  on  the  space  of 
all  distributions.  In  the  situation  in  which  the  values  themselves 
are  observed,  this  leads  to  strategies  and  success  probabilities  that 
depend  only  on  the  nonatomic  measure  of  the  process.  In  the  event  that 
only  relative  ranks  are  available,  Campbell  (1978)  showed  the  optimal 
strategy  depends  only  on  the  mass  of  the  measure.  In  that  the  process 
selects  only  discrete  distributions,  ties  are  inevitable  and  the  mass 
of  the  measure  regulates  these.  The  improvement  over  the  "no-information" 
probability  vanishes  as  the  mass  tends  to  infinity. 

It  should  be  noted  that  to  achieve  the  "full  information"  success 
probability  it  suffices  to  observe  only  {F(X.)>  where  F  is  the  under¬ 
lying  continuous  distribution.  For  large  i,  F(X^)  is  well  approximated 
by  1"  where  is  the  rank  of  X^  among  X^,  ...,  X^.  This  suggests 
representing  prior  information  in  the  form  of  a  preliminary  or  train¬ 
ing  sample  of,  say,  size  m,  and  observations  as  relative  ranks  with 
respect  to  the  entire  training  sample  as  well  as  to  predecessors  in  the 
current  sample.  This  is  the  version  presented  here. 

As  an  application  of  this  version  one  may  consider  this  year's 
Ph.D.'s  In  Statistics  who  seek  academic  employment  as  the  current  crop, 
and  those  In  the  Interviewer's  previous  years  of  experience  as  the 
training  sample.  That  there  Is  some  oversimplification  here  Is  readily 
conceded. 


The  form  of  optimal  stopping  rules  for  this  problem  turns  out  to 
be  particularly  simple;  their  parameters  depend  on  m  (the  training 
sample  size)  and  n  (the  size  of  the  current  crop)  only  through  m  +  n. 
Specifically,  there  are  Integers  {Sk*(m+n)>  non-decreasing  in  k  such 
that  the  optimal  (m.n)-policy  is  to  stop  at  i  >  m  if  the  i-th  item  is 
the  best  so  far  of  the  current  crop  and  k-th  best  of  all  i,  provided 
that  1  >  Sk*(m+n). 

Not  only  are  those  parameters  readily  computable  from  (2.6)  and  (2.8) 
and  closely  approximated  from  (2.12),  but  also  the  limits  s^*  of 
Sk*(m+n)/(m+n)  exist  as  m+rv-**>,  are  themselves  computable  from  (3.14)  and 
can  serve  as  asymptotically  optimal  approximations  to  the  optimal  (m,n) 
parameters.  (See  Table  2). 

Although  the  optimal  success  probabilities  P*(m,n)  can  be  obtained 
from  the  algorithm  given  by  (2.5)  and  (2.6)  their  true  form  is  best 
revealed  by  the  following  asymptotic  results:  For  each  fixed  n,  as 
m  increases  from  0  to  ®,  P*(m,n)  Increases  from  the  "no- Information" 
optimal  success  probability  to  the  "full-information"  optimal  value  for 
sample  size  n,  which  is  denoted  by  vn*  (Theorem  4.1).  And  for 
m/(m+n)  -*•  t,  as  m  and  n  become  infinite,  P*(m,n)  p*(t),  the  function 

p*(*)  being  given  not  only  implicitly  as  the  limit  of  solutions  to  a 
system  of  piece-wise  differential  equations  (3.7)  but  explicitly  in 
Theorem  3.1  from  which  not  only  Is  numerical  evaulatlon  a  simple 
matter  (see  Table  4)  but  also  limt^p*(t)  *  v*  can  be  proved  (Theorem  3.3). 

The  sk*'s  and  p*(-)  are  introduced  in  section  3  as,  respectively,  the 

parameters  of  the  optimal  policies  and  the  optimal  success  probabilities 
for  an  infinite  version  of  the  problem  in  which  the  arrival  times  of  the 
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best  item,  second  best,  etc.  are  independent  random  variables,  each 
uniform  on  (0,1).  This  model  provides,  among  other  things,  a 
convenient  vehicle  for  showing  in  Theorem  4.2  that  the  difference 
between  P*(m,n)  and  p*(m/(m+n))  is  of  an  order  no  bigger  than 
(m+n)~^  log(m+n).  (Numerical  evidence  -  see  Table  4  -  suggests  that 
the  actual  order  is  (m+n)'^.) 

Since  we  know  of  no  way  to  deduce  the  explicit  form  of  p*(-) 
directly  from  the  differential  equations  (3.7),  it  is  in  fact  obtained 
as  a  special  case  of  the  explicit  formula  for  the  success  probability 
of  any  "plausible"  stopping  rule.  The  formula  is  given  in  Theorem 
3.2  which  is  proved  in  Section  5.  This  formula  also  facilitates 
evaluation  of  suboptimal  rules  of  simplified  form  anal agous  to  ones 
considered  in  Gilbert  and  Mosteller  (1966),  though  we  have  not  done  so. 


2.  THE  FINITE  PROBLEM 

The  problem  of  "choosing  the  best  of  the  current  crop  with 
maximal  probability"  may  be  formulated  as  follows: 

Let  Xj,  . ...  X|n,  Xm+j,  *m+n  be  a  random  permutation  of 
{1,  2,  ....  m+n};  i.e.  all  (m+n)!  permutations  are  equally  likely;  and 

let  Yj,  Y2,  ....  denote  the  corresponding  sequence  of  relative 
ranks;  i.e.  Y^  *  j  if  is  the  j-th  smallest  of  (X^,  ....  X^}.  Let 
j  *  y(m,n)  be  the  class  of  stopping  rules  t  based  on  the  Y^'s  and  P*(m,n) 
be  the  maximal  attainable  probability  of  selecting  the  smallest  of 

Xm+1*  Vn5  i,e” 

(2.1)  P*(m,n)  *  maxT£  j-  P(X  *  min  {X^:  m<i£m+n}). 


(We  write  "max"  rather  than  "sup"  because  standard  results  in  optimal 
stopping,  as  presented  in  Chow,  Robbins,  and  Siegmund  (1971),  insure 
that  the  supremum  is  attained.)  The  object  is  to  evaluate  P*(m,n)  and 
find  the  optimal  stopping  rule  which  attains  this  value. 

As  in  other  problems  of  optimal  stopping  based  on  relative  ranks, 
the  distributional  properties  of  the  X^'s  are  exploited,  notably  the 
fact  that  the  Y^’s  are  independent  with  each  Y.  uniformly  distributed 
on  tl,  2,  ...»  i}.  This  leads  to  two  expressions  which  greatly  simplify 
the  form  of  the  standard  backward  Induction  algorithm  for  the  optimal 
rule  and  its  expected  payoff,  as  described  in  Chow,  Robbins,  and 
Siegmond  (1971).  First  the  conditional  probability  of  success  for 
selecting  the  i-th  arrival  satisfies 

(2.2)  P(X.  =  min  {X^:  m<j<m+n}  |Yr  ....  Y^ 

i  m+n  k 

"  iJ/j-j+j  ^  I{Y.j=min{Yr:m<r<i}=k} 

1 

I{Y.=min{Yr:m<r£i}=k);  m<i<m+n 

where  (x)k  h  x(x-l)  ...  (x-k+1). 

Second,  defining 

(2.3)  Pk*(1;m+n)  h  maxT>i  P ( XT <^k  &  Xt  *  min  (X^ :  i<j<m+n})  l<k<_i<m+n, 

the  optimal  success  probability  with  stopping  ranks  which  don't  select 
any  of  the  first  1  arrivals  satisfies 

(2.4)  maxT>'j  p(X  *  min  {X^:  m<j<m+n>|  Yj . Y^) 

m+1 

*  J,  Pk*(i;  ro+n>  *(m1n{Yj :  m<j<1}  -  k) 


m<1<m+n. 
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It  can  then  be  shown  that 
,  m+1 

(2.5)  P*(m,n)  =  (m+1)"1  J  max{Pk*(m+l;m+n),  (m+1 )k/(m+n)kl 

m=0 ,  It  2,  ...  i  n~ 1 t  2,  •  •  • 

where  the  Pk*(i,  m+n)'s  are  computed  Iteratively  from 


(2.6a)  Pk*(i-1;  m+n)  =  (1-k/i)  Pk*(i ;  m+n) 

i  k 

+  1  l  max{P.*(i;  m+n),  (i)./(m+n).}  k<i<m+n 

j=l  J  J  J 


together  with  the  boundary  conditions 

(2.6b)  Pk*(m+n-l;  m+n)  =  k/(m+n)  k=l,  2 . m+n-1. 


Furthermore  the  optimal  policy  is  to  stop  at  the  first  i  >  m  (if  any) 
for  which 


and 

(2.7) 


Y.  ®  min  (Y,:  m<j<i} 

*  J 

Pv  *(i;  m+n)  <  (i)v  /(m+n)v  . 
Yi  ~  ri  Ti 


It  is  noteworthy  that  (2.7)  depends  on  m  and  n  only  through 
m+n.  This  means  that  all  of  the  optimal  policies  for  a  given  value 
of  m+n  combine  the  obvious  distinct  prescriptions,  "don't  stop  If 
the  present  arrival  is  not  best  so  far  of  the  current  crop  ...",  with 
the  common  criterion,  "...  and,  If  it  Is  best  so  far  of  the  current 
crop,  and  k-th  best  among  all  1  seen  so  far,  stop  if  and  only  if 


1  6  Ik(m+n)  c  {1,2,...,  m+n}." 
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What  is  the  structure  of  the  sets  Ik(m+n)?  Since  the  right  side 
of  (2.7)  is  decreasing  in  Y..  while,  by  its  very  definition,  the  left 
side  is  increasing  in  Y..  it  follows  that 

Ik(m+n)  3  Ik+1(m+n). 

To  show  that  each  Ik(m+n)  is  actually  an  interval  of  the  form 

Ik(m+n)  =  {i:  Sk*(m+n)  <_  i  <  m+n} 

—  and,  necessarily, 

(2.8)  Sk*(m+n)  =  min  {i:  Pk*(i ;  m+n)  <  ( i )k/(m+n)k> 

—  it  would  suffice  to  establish  that  each.Pk*(i;  m+n)  is  decreasing 
in  i.  While  this  is  not  true,  the  change  of  variables 

(2.9)  Gk(i;  m+n)  =  Pk*(i;  m+n)/[(i)k/(m+n)k] 

accomplishes  the  same  end  because  then  (2.6)  becomes 

(2.10a)  Gk(i-1;  m+n)  =  Gk( i ;  m+n) 

_i  k  (m+n-j).  . 

+  (i-k)  l  [Tl.  -;7K  J]  max{G.(i ;  m+n),  1} 
j*l  J 

k<i<m+n, 

(2.10b)  Gk(m+n-l;  m+n)  =  k/(m+n-k), 

which  shows  that  Gk(k;  m+n)  is  decreasing  in  i.  Since  (2.7)  is  equiva¬ 
lent  to 

Gv  (i;  m+n)  <  1 
Ti 


this  proves 
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Proposition  2.1.  The  optimal  stopping  rule  in.  the  finite 
problem  is 

(2.11)  t*(m,n)  *  min  {i>m:  Y.  =  min{Y.:  m<j<i}  &  i  >.  Sy  *(m+n)} 

i  J  '  j 

=  m+n  if  no  such  i<m+n 

where  Sk*(m+n)  is^  given  (2.8)  and  is  non-decreasing  in  k. 

The  parameters  {Sk*(m+n)}  can  easily  be  computed  from  (2.6)  and 
(2.8).  Some  values  are  given  in  Table  2.  In  addition  they  may  be 
estimated  from  the  inequalities 

(2.12)  (1-  k-1 )(m+n)  <  Sk*(m+n)  <  ( .5)1/k(m+n)  +  k 

the  derivation  of  which  is  similar  to  (but  somewhat  less  tidy  than) 
that  for  (3.15)  in  the  next  section. 

3.  THE  INFINITE  PROBLEM 

As  in  Gianini  and  Samuels  (1976), let  the  arrival  times 
U.|,  Ug,  ...»  of  the  best,  second  best,  etc.,  be  I  ID,  each  uniformly 
distributed  on  (0,1).  This  is  motivated  by  the  fact  that  in  the 
finite  problem  not  only  the  ranks  of  successive  arrivals,  but  also 
the  arrival  positions  of  the  best,  second  best,  etc.,  form  a 
random  permutation  of  (1,  2,  ....  m+n). 

For  each  t  €  (0,1),  let  Z^(t).  Z2U),  ...,  be  the  arrival  times 
of  the  best,  second  best,  etc.  among  those  which  have  arrived  by 
time  t;  3(t)  be  the  o-fleld  generated  by  (Z^t):  i-1,  2,  ...};  and, 
noting  that  s  <  t  Implies  3(s)  c  3(t),  j  be  the  class  of  all 
(possibly  defec'*ve)  stopp  ng  rules  t  adapted  to  {3(t):  0<t<l}  with  the 
property  that  t  6  ''  .-0,  2,  ...}  on  the  set  where  t  is  defined. 
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In  this  framework  the  problem  of  interest  can  be  described  as 
follows.  For  any  t  €  [0,1)  define 

(3.1a)  K.|(t)  s  min{j:  U^>t} 

(3.1b)  Ki+.|(t)  =  min{j>K. :  Uj>t}  i=l,  2,  ... 

(so  K-|(t),  ^(t),  ...  are  the  (unobservable)  absolute  ranks  of  the 
best,  second  best,  etc.  post-t  arrivals),  and  let 

(3.2)  Ti (t)  =  UK  1-1,  2,  ... 

(so  T^t)  is  the  arrival  time  of  the  i-th  best  post-t  arrival).  The 
object  is  to  evaluate 

(3.3)  p*(t)  =  supTejr  P(T=I1(t>) 

and  to  find  for  each  t  a  stopping  rule  T*(t)  which  attains  this  maxi¬ 
mal  probability  of  selecting  the  best  post-t  arrival. 

To  do  so  it  is  helpful  to  consider  quantities  analagous  to  the 
Pk*(i;  m+n)'s  in  the  finite  problem,  namely 

(3.4)  pk*(t)  5  sup Tgjr  PU-T^t)  &  K^tJik)  k  =  1,  2,  ... 

and  the  associated  optimal  stopping  rules  tk*(t).  Oust  as  in  the 
fin.ite  problem,  p*(t)  and  T*(t)  are  obtained  from  the  Pk*(t)'s  and 
Tk*(t)'s.  It  is  evident  from  their  definition  that  Pk*U)  is 
increasing  In  k;  also,  for  any  xtj, 

(3.5)  P(T  -T^t))  -  P(T*T1(t)  &  K.| (t)  <  k)  <  P(K](t)>k)  =  tk. 
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Hence 

(3.6)  p*(t)  -  tk  <  pk*(t)  <  p*(t)  k=l,  2,  ....  0<t<l. 

For  the  problem  of  finding  (3.4),  the  conditional  success  proba¬ 
bility  for  stopping  at  a  particular  arrival  time,  say  u,  given  3(u), 
is 

k  J 

U  I(Zi(u)<t,  i<j ;  Zj(u)=u} 

if  u  >  t;,  zero  otherwise.  Since  this  is  not  a  function  only  of  u 
and  the  relative  rank  at  u  of  the  current  arrival,  the  standard  results 
of  Mucci  (1973a  and  b),  Gianini  and  Samuels  (1976)  and  Lorenzen  (1977) 
do  not  apply.  Nevertheless,  identical  arguments  lead  to  the  following 
system  of  differential  equations: 

j  i  k  . 

(3.7a)  jjr  Pk*(0  *  t"  [kpk*(t)  -  ^  max{pj*(t) ,  tJ}] 

k=l,  2,  0<t<l 

with  the  boundary  conditions 
(3.7b)  pk*(l-)  =  0. 

This  formula  is  strongly  suggested  by  letting  the  arrival  times  in 

the  finite  problem  be  {k/(m+n):  k=l,  2,  ...,  m+n}  and  rewriting  (2.6a)  as 

P  *(i;m+n)  -  P  *(i-l;m+n)  , 

- ylm+n) - -  CV(m+n)3"  [kPk*(i;  m+n) 

k 

-  I  max{P.*(i ;  m+n),  (i ) ./(m+n) .}]. 
j-1  J  J  J 

From  the  form  of  (3.7)  it  follows  that  P|,.*(t)  is  attained  by  the 
stopping  rule 
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(3.8)  xk*(t)  =  min{U^>t:  for  some  r<k,Zj(U..)<t  for  j<r, 

Zr(U.)  =  U.,  and  U .r>pr*(U1 ) } 


=  (undefined)  if  no  such  Ui ; 


also 

(3.9)  t*(t)  =  lim^  xk*(t). 


Just  as  in  the  finite  problem,  a  close  examination  of  (3.7)  leads  to 
explicit  expressions  for  these  optimal  rules.  The  bonus  here  is  that 
the  solution  to  (3.7)  itself  is  also  obtained.  Making  the  change  of 
variables 

gk(t)  =  t"kpk*(t) 
transforms  (3.7)  to 

(3.10a)  gk'(t )«-j  t'(k+1'j)max{g.(t),  1} 

k=l ,  2,  ...;  0<t<l 


(3.10b)  gk(l')  =  0. 

Thus  the  gk(*)'s  are  strictly  decreasing  and  unbounded  in  (0,  1)  so 
one  can  define 


(3.11)  sk*:  9k(sk*)  =  1  k»l,  2 . 

From  (3.10)  it  is  clear  that  for  each  t  in  (0,  1),  9k(t)  is  increasing 
in  k;  hence  so  are  the  sk*'s.  Now,  on  [sk*,  1],  (3.10)  is  simply 

gk'(t)  =  -  \  t"j 
k  J=1 

gk(i‘)  -  o. 


Hence 
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k-1  ,  • 

(3.12)  gk(t)  ■  |ln  t|  ♦  l  J  (fJ-l)I{k>))  sk*it<l. 

J“1 

Thus  a  complete  description  of  the  optimal  stopping  rules  i*(t)  can 
be  suimnarized  by 

Proposition  3.1.  In  the  infinite  problem*  there  is  a^  single 
increasing  sequence  { s :  k=l ,  2,  . . . }  such  that,  for  any  t  in  [0.1 )» 

(3.13)  T*(t)  =  min  {U^ >t :  for  some  r,Zj(U1.)<t  for  j<r,  Zr(Ui )  =  U.>sr*} 

=  (undefined)  if  no  such  U.. 

The  sequence  { sk*>  is.  given  by 
(3.14a)  =  e"1 

k-1  j 

(3.14b)  | In  s  *|  +  l  j  [(s  *)  J-l]  =  1  k=2,  3 . 

K  j=l 

Corollary:  For  any  t  €  [0,1)  and  k  =  1,  2,  ...  the  value 
Pk*(t)  is  attained  by  (3. 13)  with  sr*  replaced  by  one  for  each  r  >  k. 

Some  values  of  sk*  are  given  in  Table  1  together  with  the  upper 
and  lower  bounds  provided  by  the  following 
Proposition  3.2.  For  each  k=l ,  2,  ... 

(3.15)  1-  k_1  <  sk*  <  (l/2)1/k. 

Proof:  On  the  one  hand,  Pk*(sk*)  =  (sk*)k.  On  the  other  hand, 
taking  t  *  sk*  In  (3.4), 

(3.16)  P(K1(s|c*)  <  k+1  <  K2(sk*))  1  Pk*(sk*)  <  P*(K1(sk*)  <  k). 
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The  second  Inequality  is  immediate;  the  first  holds  because  the  first 
probability  is  smaller  than  the  success  probability  for  the  rule 
"select  the  first  post-s^*  arrival  with  relative  rank  <_  k".  The  left 
and  right  sides  of  (3.16)  are  just  lc(l-sk*)(sk*)k  and  l-(sk*)k  respec¬ 
tively.  The  result  then  follows  by  substitution-  □ 

Remark:  The  bounds  in  (3.15)  are  the  limits  obtained  by  dividing 
the  left  and  right  sides  of  (2.12)  by  m+n  and  letting  m+n  -*  °°. 

As  for  the  optimal  probabilities,  pk*(t),  these  can  be  obtained 
iteratively  from  (3.10)  and  (3.11)  since  (3.10)  is  given  by  (3.12) 
on  tsk*,l)  and  can  be  transformed  to 

k-1 

(3.17)  df  (t  9k(t))  =  -  £  f(k-j)  max{gj  (t).l>  I{k>1} 

3  1 

on  (0,  sk*).  While  one  could  certainly  contemplate  numerical  evalu¬ 
ation  of  the  pk*(t)'s  via  (3.17),  this  is  unnecessary,  for  we  have  -- 
by  indirect  means -- arrived  at  an  explicit  expression  for  the  solution 
to  (3.7),  namely  the  following: 

Theorem  3.1.  Let 

(3.18a)  r*(t)  =  max{j:  s.j*<t} 

*  0  if  t<s,*  0<t<l 

(3.18b)  rk*(t)  =  min  (r*(t),k)  k*l,  2,  ... 

Then  p*(t)  =  limk^copk*(t)  where  (with  r=rfc*( t) ) 
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r  r-1  ,  . 

(3.19)  pk*(t)  «  tr[|ln  t|  l{ri„  +  I  3  '(t  J-l)  l(ri2) 

j 

k  3  * 

+  I  tJhi*(s.+1)  0<t<l, 

j=r  J  J 

and 

(3.20a)  h0*(Sl*)  =  e'1 

(3.20b)  hk*(sk+l}  =  k’1[(sk+l*)‘k“^  "  {1-sk+l*} 

k«l,  2,  ... 

Remark:  From  the  first  inequality  of  (3.15)  we  conclude  that 


hk*(sk+r}  <  (e'1)/k  <  2k_1 


and  hence 


P*(t) 


l 

aK+l 


‘V(Vi*> 


2tK+1/k(l-t) 


K=l,  2, 


0<t<l . 

This  inequality  facilitates  computation  of  p*(t)  to  any  desired  degree 
of  accuracy  and  was  used  in  preparing  Table  3. 

It  is  straightforward  to  verify  that  (3.19)  is  a  solution  to 
(3.7)  which,  strictly  speaking,  proves  the  theorem.  But,  without 
some  indication  of  how  (3.19)  might  be  deduced  directly  from  (3.7),  this 
Is  hardly  satisfactory.  And  we  know  of  no  such  deductive  method. 

Our  method  of  obtaining  (3.19)  begins  by  considering  the  class  of 
all  families  of  stopping  rules  of  the  form 


(3.21)  x (t ;  s)  s  min  {U^>t:  Zj(U^)<t  for  j<r,  Zr(U^)  s  U^>sr} 
*  (undefined)  if  no  such 
where  £  ■  (s^,  Sg,  ...)  is  any  sequence  with 
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(3.22)  0  <  s-j  £  Sg  £  . . .  £  1  • 

The  optimal  stopping  rules  are  of  this  form;  namely 
T*(t)  =  t ( t ;  s*) 

Tk*(t)  =  T(t;  s^*) 

where 

s*  =  (Sl*  s2*.  ...) 

V  =  ^si**  •••»  sk*’  ^ 

For  any  £  satisfying  (3.22),  denote 

$k  —  (s^,  ...,  Sj^i  1,  1,  ...) 

and  also 

(3.23)  r(t;  s)  =  sup  {j:  Sj<t}  I{t>$  j 

=  «d  if  sup  sfc£t 

(3.24a)  p(t;  s)  =  P(t(t;  s)  =  T^t)), 

(3.24b)  ^(t;  s)  =  p(t;  sk). 

Then  it  can  be  shown  that  on  (0,  sup  sk)  -  {s ^ ,  with  r  =  r(t;  s^), 

(3.25)  dl  pk(t;  i)  =  t‘1tk  pk{t;  £>  '  l  tJl{r>l> 

j=l 

k 

l  P,(t;  s)  I,  <k}] 
j=rH  J  lr 

k=l ,  2,  ... 

which  has  a  unique  system  of  continuous  solutions  satisfying  the  boun¬ 
dary  conditions  Pk(sup  sk;  s)  =  0.  That  solution  Is  given  by  the 
following  : 

Theorem  3.2.  For  any  ^  satisfying  (3.22)  and  for  t  <  sup  sk. 


t 
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r-1 


(3.26)  P(t;  s)  =  tr[|1n  t|l{rll}+  I  (fJ-l ) I lr^2}] 

3  ^ 

00 

k=r  1  Msk+l} 
where  r  h  r(t;  s.)  <  ®  and 


(3.27a) 

(3.27b) 


^0^S1 ^  =  S1  I S1 I 

hk(sk+1}  =  k’1^st4.i)’k-13 


k+1 
’k+1 


0-st+1){!ln  sk+1|  +  l  j'1[(sk+1)'J-l]} 

3  1 

k=l ,  2, 


In  particular  pk(t;  s)  is  given  by  the  right  side  of  (3.28)  with 
r  «  r(t;  sk). 

As  with  (3.7)  and  (3.19),  (3.25)  can  be  verified  in  a  straight¬ 
forward  manner  for  pk(t;  s_)  as  given  above,  but  the  latter  cannot  be 
directly  deduced  from  the  former.  A  detailed  proof  of  Theorem  3.2 
without  using  (3.25)  is  deferred  to  section  5,  while  the  relationship 
of  Theorem  3.1  to  Theorem  3.2  will  be  presented  here. 

First,  the  two  theorems  are  consistent  because,  for  each  k, 

Vsk+1*)  =  hk**sk+l*)  and  V1)  *  0;  hence  pk*t;  V*)  =  pk*^* 

Second,  Theorem  3.2  has  the 

Corollary;  For  each  t  in  (0,1),  p(t;  %)  and  (pk(t;  sj,  k* 1 ,  2,  . . . ,} 
are  all  maximized  ^  =  s*. 

Remark:  This  corollary  states  only  that  x(t;  s*)  is  optimal  in 
the  subclass  of  all  rules  of  the  form  (3.21).  Its  optimality  in 
the  class  of  all  stopping  rules  is  asserted  by  Proposition  3.1.  Thus 
it  is  the  combination  of  two  methods  which  yields  the  solution  to  the 
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Infinite  problem.  One  Is  the  so-called  "method  of  backward  Induc¬ 
tion"  which  leads  to  (3.7);  the  other  Is  the  probabilistic  "conditioning" 
argument  used  in  section  5. 

Proof  of  Corollary:  Strictly  speaking,  no  proof  is  needed  since 
the  corollary  follows  immediately  from  Proposition  3.1  and  its  corol¬ 
lary.  So  we  shall  merely  indicate  briefly  how  it  follows  directly  from 
(3.26)  and  (3.27).  There  are  two  key  facts  to  be  checked.  First  , 

each  h. (x)  is  concave  in  (0,1)  and  maximized  at  s,  *,  so 
K  k+1 


r(t;  s' ) 


r(t;  s)  &  0  <  (s'k-sk*)/(sk-sk*)  <  1 

k=l ,  2,  ... 


=>  p(t;  )  >_  p(t;  s_) 


Second,  if  s  and  s'  are  such  that  sfc  *  sk‘  whenever  sfc  >  t  or  sk'  >  t, 
then 

P(t;  s')  =  p(t;  s)  . 


From  just  these  two  facts  it  follows  that  any  can  be  replaced  by 
without  decreasing  p( t ;  sj.  □ 

We  return  now  to  Theorem  3.1  to  investigate  the  optimal  best  choice 
probability  p*(t)  for  t  =  0  and  as  t  *  1.  The  former  case  is 
immediate: 


P*(0)  =  hQ*(s1*)  =  e'1; 

this  is  the  familiar  limiting  no-information  value.  As  t  -*>  1  one  would 
expect  p*(t)  to  converge  to  the  limiting  full-information  value  v*. 

The  following  expression  for  v*  was  obtained  by  Samuels  (1980): 


(3.28)  v*  s  e”c  ♦  (ec-c-l)  /"  x"VCxdx 
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where  c  «  .804352  satisfies 

oo 

(3.29)  l  cj/j!j  =  1. 

From  the  formula 

IT  x'Vcxdx  =  | log  c|  -  Y  -  I  (-c)j/j!j 

j=l 

where  y  Is  Euler's  constant  (y  .577216),  v*  can  easily  be  evaluated. 
The  value  is  v*  »  .580164,  in  agreement  with  that  in  Gilbert  and 
Mosteller  (1966). 

Theorem  3.3.  The  following  limits  hold; 


(3.30) 

p*(t)  =  v* 

(3.31) 

11m. 

k-*» 

k(l-sk*)  *  c 

(3.32) 

(l-t)r*(t)  =  c 

where  v*  and  c  are  given  by  (3.28)  and  (3.29)  respectively. 

Proof  of  Theorem;  To  establish  (3.31),  write 

(3.33)  sk*  *  k/(k+ck) 
and  use  the  identity 

l  rVJ-n  *  i 

j-i  j.i  •> 

with  x  *  sk*.  It  follows  from  (3.14b)  that 

l  r7({)(ck)Vj  -  k*,C(l+k"1ck)k-l] 

j«l  3  K  * 

+ln  (1+k  ck)  ■  1. 
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Now,  by  the  left  side  of  (3.15),  ck  <  k/(k-l),so  the  second  and  third 
terms  on  the  left  side  go  to  zero  with  k.  Thus, 

lim^  .f  1(j)(ck)'V'}  =  1. 

Writing  the  sum  as 


\  [(c)5  V  ( 1 -k” 1 i )]/j ! j 

j=l  K  i=l 

the  dominated  convergence  theorem  can  be  employed  to  show  that  for  any 
convergent  subsequence  c.^  -*  c,  c  must  satisfy  (3.29).  Thus  c^  -*•  c 
and,  since  s^*  -*■  1  by  (3.15),  (3.31)  holds. 


Formula  (3.32)  is  obtained  from  (3.31)  by  noting  that  r*(t)  *  k 
if  sk*  <  t  <  s*k+i«  and  by  using  (3.33),  which  gives 

sk*  £  t  <  s*k+1  «*  t  ck+1  -  (1-t)  <  (l-t)k  <  t  ck. 

Note  that  (3.32)  implies 

(3.34)  limt+1  tr*^  •  e’c  . 


Also,  substituting  (3.33)  into  (3.20b)  gives 
(3.35)  h*.,(sk*)  -  ± 

Thus, 


(3.36) 


k=r*(t) 


h,*(s* 
k  k+1 


)  w  (ec-c-l)  l 
k*r* 


k»r*(t) 


k-v. 


f 
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The  remainder  of  the  proof  is  based  on  the  following  two  calculus 
lemmas  which  are  straightforward  to  prove. 

Lemma  1:  If  n{t)  ■»  as  t  increases  to  1,  then 


I  ,  k''tk  -  rnM  x'Vd*  =  n  y-'tn(t)ydy; 

k>n(t)  1 

hence  if  n(t)  In  t -  b  as  t  1 ,  then 

l  k-l,k  -  f  y-VbXd*‘ 

kin(t)  k  *  1  x  e  dx 

Lemma  2:  ^f  n(t)  +  »as  t  increases  to  1  but  n(t)  In  t  =  0(1), 


,<k|n(t)  k''(t-k-l)  «/^t>x‘1(fx-l)<lx. 

Hence,  if_  n(t)  In  t-*-  -  b  as  t  -*■  1,  then 

I  k-'(t-k-l)*./b  y-’(t-"(t)>/b-1)dy 

lj<k<n(t)  b/n(t) 

^/Sy-V-Udy-j^. 


From  (3.34)  observe  that  r*(t)  and  r*(t)  -  1  satisfy  all  of  the 
conditions  on  n(t)  in  both  Lemmas,  with  b  =  c.  Thus  from  Lemma  2, 
with  n(t)  *  r*(t)  -  1,  and  from  (3.34) 

tr*(t)  1  j"1 (t"j-l )  +  e“c  l  cj/j!j  =  e"c 

j*l  j=l 


and  from  Lemma  1  with  n(t)  *  r*(t),  together  with  (3.34), 

I  .  tJb1*<sttl*>  *  (ec-c-1)  /'  x'Vcxdx  . 
j«r*(t)  J  J  '  1 
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The  only  remaining  term  in  (3.19)  is  tr  ^  |ln  t|  which  goes  to  zero. 
This  completes  the  proof  of  Theorem  3.3.  □ 

4.  ASYMPTOTIC  RESULTS 

The  following  results  will  be  proved  in  this  section: 

Theorem  4.1.  For  each  fixed  n,  P*(m,n)  increases  to  v*(n)  as 
m  -*-  <*>. 

Theorem  4.2.  As  m  +  n  -*■  «, 

|P*(m,n)  -  p*(m/(m+n))j  =  0  ((m+n)-*  log(m+n)) 

uniformly  rn  e  <  m/(m+n)  <  1  -  e  for  any  e  >  0. 

Corollary.  As  m  +  n 

(m+n)"1  Sk*(m+n)  -►  s^*  k=l,  2,  ...  . 

Since  v*(n)  is  the  full -information  success  probability.  Theorem 
4.1  confirms  the  fact  that  a  large  prior  sample  size  provides  enough 
information  to  enable  one  to  choose  the  best  of  a  current  crop  of  size 
n  with  probability  nearly  as  high  as  if  sampling  from  a  known  contin¬ 
uous  distribution. 

Theorem  4.2  and  its  corollary  show  that  the  infinite  problem 
success  probabilities  and  optimal  stopping  rules  can  serve  as 
approximations  to  the  finite  problem  ones.  Also,  it  can  easily  be 
checked  that  the  finite-problem  policies  Sk(m+n)  =  [(m+n)$k*]  are 
asymptotically  optimal. 

Proof  of  Theorem  4.1.  Let  {Z^:  l<j<m+n>  be  Independent  random 
variables  each  uniform  on  (0,1);  -tX^:  l<i<m+n)  be  the  ranks  of  the 
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Z^'s,  (Y^.:  l<i<m+n}  be  the  relative  ranks,  and,  if  m>l,  {Y. ' :  2<^i<m+n} 
be  the  relative  ranks  of  {Z^:  2<i<m+n).  Let  J"(m+n)  be  the  class  of 
all  stopping  rules  adapted  to  {Y^}. 

Each  Y.'  is  a  function  of  (Y . :  jo},  so  y(m+n)  contains  all 

*  y 

stopping  rules  adapted  to  {Y^'}.  But  the  distribution  of 
(Y.. '  :2<i<m+n}  is  the  same  as  that  of  {Y..:  lo_<(m-l  }+n}.  Thus  the 
optimal  success  probability  in  this  subclass  of  y(m+n)  —  when  m>l  --  is 
precisely  P*(m-l,n),  which  is  necessarily  no  greater  than  P*(m,n).  This 
proves  monotonicity  of  P*(m,n)  in  m. 

Since  each  Y.  is  a  function  of  {Z.:  j^i),  it  follows  that 

*  J 

Yn*  1.  P*(m»n)  for  each  m  and  n.  Hence,  to  complete  the  proof,  it 
suffices  to  show  that  lim  inf,  P*(m,n)  >  v  *•  This  is  accomplished 
by  exhibiting  a  sequence  of  stopping  rules,  Tm>  for  which  p(tmsTm*)  -*  1 
where  xm*  is  the  optimal  rule  based  on  {Z..:  l<i<m+n}  which,  as  Gilbert 
and  Mosteller  (1966)  showed,  is  of  the  form 


We  can  then  let 


•t„  =  min  {i:  m 
m 


Vi  -bn-i}’ 


which  is  satisfactory  because,  as  ji  +  •,  {m_1  Ym+^:  l<i<n}  converts 
in  distribution  to  {Zm+^ :l<i<n}.  □ 

The  proof  of  Theorem  4.2  is  based  on  the  following  propositions: 
Proposition  4.1:  For  any  t  >  m/(m+n). 


P*(m,n)  >  p*(m/(m+n))  -  [1  -  E(X/n )I{X<n>3 
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where  X  _U  a^  binomial  random  variable  with  parameters  m+n  and  1-t. 

Proposition  4.2:  For  any  t  £m/(m+n), 

p*(m/(m+n))  >_  E(n/Y)I{y>n}P*(m,n) 

where  V  is  a  negative  binomial  random  variable  with  parameters  m+1 
and  t  (i.e.,Y  has  the  distribution  of  the  number  of  failures  before 
the  (m+l)-th  success). 

Before  proving  these  two  propositions  let  us  show  how  the  theorem 
follows  from  them  by  a  slight  adaptation  of  a  familiar  large  deviation 
result  for  binomial  distributions. 

The  method  in  Feller  (1968;  p.  193)  yields  this 
Lemma:  If  Z  Is  binomial  (N,p),  then,  for  any  e  >  0, 

supe<p<l-e  P<IZ-EZI  >  N *  *  0(N^/1og  N). 

To  use  this  lemma  with  Proposition  4.1,  choose  t  =  m/(m+n)+  (m+n)"'2  log  (m+n) 

1 

so  EX  =  n  -  (m+n)2  log(m+n).  Then 

E(X/n)I^x<nj  >_  [l-2n"1(m+n)2log(m+n)]  •  P(n-2(m+n)"log(m+n)<X<n) 

=  [l-2(l+mn”1)(m+n)”2log(m+n)]  •  P(  |X-EX|Wm+n  log(m+n)). 

Now  (1+mn'1)  is  uniformly  bounded  if,  for  some  e  >  0,  e  <  m/(m+n)  <  1  -  e; 
so,  for  t  as  given  above, 

(4J)  supe<m/ (m+n)<l-e  n-E(X/n)I{Xin}]  =  0((m+n)^log(m+n)). 

The  application  of  the  lemma  to  Proposition  4.2  is  similar.  Choose 
t  =  m/(m+n)  -  (m+n)"^  log (m+n).  Then  for  any  c  >  0 
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E(n/Y)I{Y>n}  >  [1-2  cn_1(m+n)2  log(m+n)]  •  P(n<Y<n+2c(m+n)^  1og(m+n)) 

*  [l-2c(1+mn”^ )(m+n)”2  1og(m+n)]  •  [P(V>m)-P(U>m)] 

where  U  and  V  are  binomial  random  variables  with  parameter  m+n  and  t, 

1 

and  the  integer  part  of  [m+n+2c (m+n )5  log(m+n)]  and  t,  respectively.  Then 
EU  =  m  -  (m+n)2  log(m+n),  and,  taking  c  *  t~\  EV  =  m+(m+n)2  log(m+n)-6 
where  6  <  1  and  can  be  neglected.  Hence 

P(V>m)  -  P(U>m)  *  P(V-EV^-(m+n)2  log(m+n))  -  P(U-EU>_(m+n)2  log(m+n)). 

Since  c(l+mn’^)  is  uniformly  bounded  if,  for  some  e  >  0, e  <  m/(m+n)<l-e, 
conclude  that 

<4-2)  su’)e<m/(m+n)<l-€  0-E<n/m(y,n)]  -  log(arHi))  • 

The  theorem  now  follows  immediately  from  (4.1),  (4.2),  and  the 
two  propositions. 

Proof  of  Proposition  4.1:  Modify  the  infinite  model  of  section  3 
by  augmenting  the  o-fields  3t  to  3t  v  3q  where  3Q  is  generated  by  the 

order  statistics,  u^)’  •••*  u(m+n)  U2’  Um+n’  the 

arrival  times  of  the  m+n  best.  Denote  by  J  the  augmented  class  of 
stopping  rules  which  includes  all  those  adopted  to{3t  v  3Q}.  Letting 

I  =  min  {i:  U..  =  for  some  j  >  m}, 

Uj  is  the  arrival  time  of  the  best  of  the  last  n  to  arrive  among  the 
m+n  best. 

Clearly 

supt€j  P(t*Uj)  =  P*(m,n) 
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which  is  attained  by  the  rule,  say  x(m,n),  which  corresponds  to 
T*(m,n).  Hence,  in  particular,  for  any  t  >_  m/(m+n) 

(4.3)  P*(m,n)  =  P(7(m,n)=Uj) 

>  P(T*(t)=Uj) 

>  P(i*(t)=Z1(t))  -  P(Z(t)f*Uj) 
s  p*(t)  -  P(Z1 (t)^Uj) 

1  p*(m/(m+n))  -  P(Z1 (tJ^Uj ) . 

Let  X  ■  max{j:  u(m+n_j)  >  t}.  Then  X  has  a  binomial  distribution 
with  parameters  m+n  and  1-t, 

(4.4)  {X  <  n  &  Uj  >  t}  c  {Z1 (t)  =  Uj}, 
and 

(4.5)  P(X  <  n  &  Ur  >  t)  =  E(X/n)I{X<nj. 

The  proposition  follows  immediately  from  (4.3),  (4.4),  and  (4.5).  □ 

Proof  of  Proposition  4.2:  Returning  to  the  unaugmented  infinite 
model  of  section  3,  for  fixed  m,  n,  and  t  let  Y  be  the  number  of 
post-t  arrivals  which  are  better  than  the  (m+l)-th  best  pre-t  arrival. 

Then  Y  has  the  negative  binomial  distribution  with  parameters  m+1  and  t. 

Now  let  R^,  R2»  ...»  Ry  denote  the  successive  relative  ranks  of 
these  Y  arrivals.  On  the  event  {Y>n},  R^,  ...»  Rn  have  the  same  joint 
distribution  as  the  last  n  relative  ranks  in  the  finite  (m,n)  problem; 
namely, they  are  independent  with  R^  uniformly  distributed  on  {1,  2,  ....  m+1 
Moreover  the  R^'s  are  "observable",  so  there  is  a  stopping  rule  t  in  I 
which  on  Y>n  is  based  on  R^,  ....  Rn  In  exactly  the  same  way  as  the 
optimal  (m,n)-po!1cy  Is  based  on  the  last  n  relative  ranks  in  the 
(m.n)-problem. 
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Note  that  on  the  event  (Y>n}  the  best  of  the  first  n  of  the  Y 
arrivals  is  the  best  post-t  arrival  if  and  only  if  the  best  of  all  Y 
is  among  the  first  n  of  them.  Hence,  if  t<m/(m+n), 

(4.6)  p*(m/(m+n))  >  p*(t)  >_  P(t  selects  best  post-t  arrival) 

>_  P(Y>n  &  best  of  all  Y  is  among  first  n)  P(x  selects  best  of 
first  n  |Y>n  &  best  of  Y  is  among  first  n). 

The  first  factor  on  the  right  side  of  (4.6)  is  E (n/Y ) I ^ Y>n }  because 
whatever  the  value  of  Y  the  ranks  of  the  Y  arrivals  relative  to  each 
other,  are  just  a  random  permutation  of  {1,  ....  Y>.  The  second  factor 

is  P*(m,n)  because,  given  Y>n,  the  distribution  of  R,,  ....  Rp  does 
not  depend  on  whether  or  not  the  best  of  all  Y  arrivals  is  among  the 
first  n.  Substituting  these  values  onto  (4.6)  completes  the  proof.  □ 

5.  PROOF  OF  THEOREM  3.2 

Let  t  6  (0,1)  and  s  =  (s1 ,  Sg,  . . .),  with  0  <  s^  <  s2  <  ....  be 
fixed.  The  abbreviations 

t  s  i(t;  s) 

Ki  =  K.(t) 

T,  .  Vt> 
r  s  r(t;  s) 

will  be  used  throughout  the  proof. 

Begin  by  partitioning  the  event  Ct-ST^}  according  to  the  value  of  the 
smallest  i  for  which  t  <  T^  <  T^;  so 

(5.1)  Pfr-T,)  -  J2  P^-T,  i  Tj>. 
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Note  that  on  the  event  (T^min^  T^}  the  relative  rank  of  the 
arrival  at  T\  is  K^-fi-l).  It  can  then  be  shown  that  on  (T^T^min^^ .  "T} 


(5.2) 

Ti  *  %-ii-i)  * 

T  <  Ti  SO  T  f  T1 

Tf  ‘  SK,-(i-t)  * 

t  i  T^or  t  undefined. 

The  first  implication  is  Immediate;  the  second  is  true  because  (a) 
t  1  T.  by  definition  of  t;  (b)  r  /  since 


<  Tj  &  Tk  =  min 


*<k  Tt 


k  >  1 


*  Kk-(k-1)  >  K.-(i-l) 
so 

\-(k-l)  -  SKr(i-l)  >  Ti  >  Tk; 

and  (c)  t  (L  (Tj.Tj)  since  on  (T.  <  T1  <  min^^  T^)  there  is  no  k  for 
which  both  Tk  =  min^  and  Tk  €  (T^.Tj). 

From  (5.2)  it  follows  ironediately  that 


(5.3)  (t  =  Ti  &  T^<T^<min^<j<iTj }  = 

{Ti<T1<m1n1<j<1Tj  4  T1>sk1  4  T1<sK1-(i-l)}’ 
and,  from  the  independence  of  {T^}  and  {K^}, 

(5.4)  p(Ti<Ti<mini<j<iTj  4  T1>sk1  &  Ti<sKi-(i-l)^ 

=  J2  P(Ti<Tl<minl<j<1Tj)  kf1  Jk  p(Ki*k»Ki!,i“1+^)p(Z(i)<sa  4  fa 
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where  Z^j  and  denote  the  smallest  and  second  smallest  order 
statistics  from  a  random  sample  ly  ....  Z^,  I ID,  each  uniform  on 
(t,l).  Now  it  is  straightforward  to  evaluate  the  terms  on  the  right 
side  of  (5.4).  They  are 

(5.5)  p(Ti<T1<min1<j<iTi)  * 

(5.6)  P(K1=k,K.*i-l+0  *  P(K1=k,K.-K1=i-ln-k)  =  (1"^’k)t*‘1  (1-t)1 
and 

(5.7)  KZ(1)<Sa  &  Z(2)>s k)  * 

,  0  if  t  >  s  since  l,->\  >  t 

’  sl  (1 ) 

l-d-s^Vd-t)1  =  l-P(Zj>s£,j=l,  2,  ....  i)  if  sk  <  t  <  s^ 
d-tr^Ks^d-s^^Uo-s^’-n-s^1} 

=  P(exactly  one  of  Zj,  . ..,  ZR  €  (t,sk)  &  all  others  >  s^) 
+  p(sk  <  If  t  <  sk. 

Substituting  (5.5),  (5.6),  and  (5.7)  into  (5.4),  then  (5.4)  into 
(5.1)  using  the  identity  (5.3),  gives 

P(t*T1)  =  A(r)  +  B(r)  +  C(r)  +  D(r) 

where 
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aw  -  jr’o-1)-1  I  i  <1"i*2'k)tt'1n-t)1i  l 

i=2  k=l  A=r^l  1  L  tr-l} 


B(r)  «  -  l  tk-’  l  r'(i-l)-'  l 

k=r+l  1=1  1  £  k  ir~u 


C(r)  =  I  t*  1  J  1(1-1)  I  ( 
k=r+l  1-2  1=0 


(1-t) 


D(r)  =  -  [  t*-1  J  r’d-l)-1  J  t1’^)  (1-sJ1. 

k=r+l  1=2  *=?+l  1  c  * 


Now  from  the  identity 


.1  =  1 


1=2,  3,  . . .  j  0<u<l 


B,  C,  and  D  are  easily  simplified  to 

C(r)  =  l  tW(MJ 
k=r+l  K 


B(r)  +  D(r)  =  J  tK*'[hk_  ,(sk)-(l-s .)] 
k=r+l  K  1  K  K 

with  h|((s|c+i)  as  defined  in  (3.27).  Then 

B(r)  +  C(r)  +  D(r)  =  jf  tjh.(s.  ,)  , 

j=r  J  J 

so,  from  (3.26),  to  complete  the  proof  it  must  be  shown  that  A(r)  =  E(r) 


where 


E(r)  =  trC|ln  J''<t'J-l)I{r>2)]- 

This  Is  trivally  true  for  r  =  0  and  can  be  proved  by  Induction  on 
r  for  r  ^  1  by  verifying  the  following  equalities: 


E(r+1 )  =  t[E(r)  +  r-1  (l-t)r] 


r=1 ,  2,  . . . 


(5.8)  A(r+1)  =  t[A(r)  +  u{r)]  r=1,  2,  ... 

—  where 

u(r)  •  I  r'u-ir'  1 

i=2  j*r+l 

—  and 

u(l)  =  1-t 

u(r-l)  -  u(r)  =  (r-l)_1(1-t)  -  r_1(1-t)r  r=1,  2,  ...  . 

Hence 

u(r)  =  r_1(l-t)r  r=1»  2 . 

Substitute  this  into  (5.8)  to  conclude  that  A(r)  =  E(r)  and  complete 
the  proof.  □ 

It  should  be  noted  that  the  common  value  A(r)  =  E(r)  is  in  fact 
just  E  X  I{x>1}  where  X  is  the  number  of  failures  before  the  r-th 
success  in  independent  Bernoulli  trials  with  success  probability  t. 

This  can  easily  be  seen  by  verifying  that 

w  (r)  =  l  J'VT1)  tr(l-t)j 
M  3 

satisfies 

w(l)  »  A(1 )  «  E ( 1 ) 
and 

w(r+l)  ■  t[w(r)  +  r’^(l-t)r].  r=l,  2,  ... 

(See  Feller  [1968;  p.  241  problem  33  and  solution  on  p.  493]  for  a  related 
result.) 
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TABLE  1 

Parameters  of  Optimal  Policies  for  the  Infinite  Problem 


Lower  Bound 

Exact  Value 

Upper  Bound 

(l-k-1) 

<t  * 

sk 

(.5p 

0 

.36788 

.500 

.500 

.64200 

.707 

.667 

.75181 

.794 

.750 

.81018 

.841 

.800 

.84636 

.871 

.833 

.87098 

.891 

.857 

.88880 

.906 

.875 

.90230 

.917 

.889 

.91287 

.926 

.900 

.92139 

.933 

.909 

.92838 

.939 

.917 

.93424 

.944 

.923 

.93921 

.948 

.929 

.94348 

.952 

.933 

.94719 

.955 

.938 

.95044 

.958 

.941 

.95332 

.960 

.944 

.95588 

.962 

.947 

.95817 

.964 

.950 

.96024 

.966 

.952 

.96211 

.968 

.955 

.96382 

.969 

.957 

.96537 

.970 

.958 

.96680 

.972 

.960 

.96812 

.973 

i 

I 
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TABLE  3 

Optimal  Success  Probabilities,  and  Limits, 

p*(t),  for  Fixed  tsm/(m+n) 

min(m,n) 


2 

4 

6 

8 

10 

14 

20 

24 

30 

P*(t) 

,439 

.421 

.416 

.413 

.412 

.410 

.409 

.405 

.453 

.435 

.428 

.424 

.422 

.420 

.418 

.418 

.414 

.486 

.456 

.446 

.442 

.439 

.436 

.434 

.433 

.432 

.428 

.544 

.497 

.481 

.475 

.471 

.466 

.463 

.461 

.460 

.455 

.600 

.532 

.514 

.505 

.499 

.493 

.488 

.487 

.485 

.478 

.667 

.583 

.554 

.541 

.533 

.524 

.517 

.515 

.512 

.502 

.700 

.593 

.569 

.557 

.549 

.540 

.534 

.532 

.529 

.520 

.700 

.610 

.581 

.569 

.561 

.553 

.547 

.554 

.542 

.533 

.714 

.621 

.594 

.582 

.574 

.566 

.560 

.557 

.555 

.546 

.722 

.628 

.602 

.589 

.581 

.573 

.567 

.565 

.553 

.727 

.633 

.606 

.594 

.587 

.578 

.572 

.558 
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